Automatic methods for lexical stress assignment and syllabification

نویسندگان

  • Steve Pearson
  • Roland Kuhn
  • Steven Fincke
  • Nick Kibre
چکیده

Improvements in automatic lexical stress assignment and syllabification can increase the quality of text-to-speech synthesis as well as decrease the memory requirements for dictionaries. Several methods were evaluated. Machine-learning based methods are preferred since they easily adapt to multiple languages. For stress prediction, encouraging results were obtain by combining a decision tree approach with an algorithm that uses global (word level) statistical data derived from the training dictionary. For syllable boundary prediction, algorithms that learn syllable level statistics from the training dictionary perform very well, and can be implemented as a post-process after prediction of phoneme transcription and stress.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic word stress marking and syllabification for Catalan TTS

Stress and syllabification are essential attributes for several components in text-to speech (TTS) systems. They are responsible for improving grapheme-to-phoneme conversion rules and for enhancing the synthetic intelligibility, since stress and syllable are key units in prosody prediction. This paper presents three linguistically rule-based automatic algorithms for Catalan text-to-speech conve...

متن کامل

A Semi - Automatic System for the Syllabification and Stress Assignment

This Master's Thesis concerns research in the automatic analysis of the sub-lexical structure of English words. Sub-lexical structure includes linguistic categories such as syllabification, stress, phonemic representation, phonetics, and spelling. This information could be very useful in all sorts of speech applications, including duration modeling and speech recognition. ANGIE is a system that...

متن کامل

Phonological Processing for Urdu Text to Speech System

Determining and modeling phonological phenomena is necessary to generate speech from textual input. These phenomena include letter to sound conversion, syllabification, sound change, stress assignment and intonation assignment. This paper presents work on Urdu phonological processes and provides algorithms to convert textual input into phonologically annotated output, required for Urdu text-to-...

متن کامل

Linguistic-prosodic processing for text-to-speech synthesis in italian

The linguistic-prosodic processing applied to text-to-speech synthesis in Italian is described. It proceeds in 5 steps: tokenisation and normalisation of abbreviations, numbers, etc.; part-of-speech tagging, based on function words, terminations and contextual heuristics; shallow parsing, based on a chunk grammar; grapheme-to-phoneme conversion, lexical stress assignment and syllabification by ...

متن کامل

Automatic syllabification in English: a comparison of different algorithms.

Automatic syllabification of words is challenging, not least because the syllable is not easy to define precisely. Consequently, no accepted standard algorithm for automatic syllabification exists. There are two broad approaches: rule-based and data-driven. The rule-based method effectively embodies some theoretical position regarding the syllable, whereas the data-driven paradigm tries to infe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000